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ABSTRACT 


This paper is an extension of earlier work (Part I) whicn provided practical adap- 
tive techniques for the efficient noiselvss coding of a broad class of data sources 
characterized by only partially known and varying statistics (JPL Publication 79-22). 
The results here, while still claiming suc^ general applicability, focus primarily on the 
noiseless coding of image data. A fairly complete and self-contained treatment is provid- 
ed. Particular emphasis is gl^en to the requirements of the forthcoming Voyager II en- 
counters of Uranus and Neptune. Performance evaluations are supported both 
graphically and pictorially. 

Expanded definitions of the algorithms in Part I yield a computationally improved 
set of c )tions for applications requiring efficient performance at entropies above 4 
bits/sample. These expanded definitions include as an important subset, a somewhat 
less efficient but extremely simple “FAST” compressor which will be used at the 
Voyager Uranus encounter. Additionally, options are provided which enhance perfor- 
mance when atypical data spikes may be present. 
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I. INTRODUCTION 


References 1 -3 provided the development and analysis of some practical adap- 
tive techniques for the efficient noiseless coding of a broad class of data sources. 
Specifically, these algorithms were developed for efficiently coding discrete memory- 
less sources which have known symbol probability ordering but unknown probability 
values. A general applicability of these algorithms to solving practical problems is ob- 
tained because most real data sources can be simply transformed into this form by ap- 
propriate reversible preprocessing. Application to image data compression is a particu- 
larly important and straightforward example, having motivated most of '.iris earlier 
work.f^l'f®! 


This paper also derives its motivation from an image data compression problem 
arising from the combination of severe limitations in both on-bnard processing and 
available data rate of the Voyager II spacecraft which will encounter Uranus in 1 986 
and hopefully, Neptune in 1989. In addition to providing generalizations of the source 
independent algorithms of Refs. 1 -3, this paper explicitly deals with the preprocessing 
requirements of monochrome image data sources, yielding a fairly complete treatment 
of noiseless image compression. The techniques presented span a broad range of perfor- 
mance and complexity. New results include the definition and evaluation of: 

a) an extremely simple “FAST" compressor for the Uranus encounter; 

b) algorithms which maintain efficient performance while reducing 
computation; 

c) algorithms which improve performance; 

d) algorithms to deal with atypical data spikes. 

The material presented here is mostly self-contained although the reader may 
v/ish to review the additional concepts and background supplied by the references (e.g., 
explicit coding examples in Ref. 1 ). Tne notaxional convention and method of presenta- 
tion developed in Ref. 1 will be followed although it will be reviewed as needed hero. 
Although considerable attention is given to the Voyager problem, the reader should keep 
in mind the general applicability of these techniques to other imaging and non-imaging 
problems. 
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BACKGROUND AND REVIEW 


Discrete data sources arising from practical problems are generally characterized 
by only partially known and varying statistics. As just noted, the algorithms in Refs. I -3 
were developed for efficiently coding discrete memoryless sources which have known 
symbol probability ordering but unknown probability values. A genera! applicability of 
these algorithms to solving practical problems is obtai.ied because most real data 
sources can be simply transformed into this form by appropriate reversible preprocess- 
ing. The latter operations include those that first 'emove correlation (memory) to pro- 
duce samples which are approximately independent and then relabels these into thi. in- 
tegers 0, 1 , 2, ... such that a smaller integer is more likely to occur than a larger one. 
That is, if pj denotes the probability of integer i, then the desired condition resulting from 
preprocessing is 


Po-Pl-P2- -- 

The reversible preprocessing step yielding (1) is followed by a mapping of the 
integers into variable length codewords. Compression is obtained on an average basis 
by using shorter codevyords for the most frequently occurring integers (by (1), the 
smaller ones) and longer codewords for the less likely integers (the larger ones). It is the 
mechanism for assignment of codewords that Refs. 1 -3 primarily address. Here, we will 
in addition, treat the reversible preprocessing function for image data sources. 

Image Preprocessor 

In some problems a preprocessor may need to be adaptive to ensure that the de- 
correlation process is maintained and that condition (1) remains well approximated. But 
for the imaging problem, such as Voyager's, the preprocessor can usually be fixed and 
quite simple. 

The first step in any application is to fully utilize any source memory or a priori in- 
formation. For imaging this amounts to the generation of a sequence o1 "differences" 
from predicted picture element (pixel) values. The better that prediction process per- 
forms, the less uncertainty there is in the sequence of output symbols. This itself leads 
to fewer bits in the coding process to follow. However, using any more than the im- 
mediately adjacent pixels by a predictor offers negligible improvement in performance. 
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Thus practical one and two-dimensional predictors reduce to simple functions of the pix- 
els immediately before and/or above a predicted pixel. Letting Xj j denote the jth pixel 
value on the ith line, the candidate practical predictors, x: :, for x: : are summarized in 
Table l .t 


Table 1. Practical image Predictors. 


ONE-DIMENSIONAL 

TWO-DIMENSIONAL* 

X. . = X. . ^ 
irj 

X. . , -1- X. , . 
/s l,J'1 1-1, J 

X. . = 

M 2 

* use roundoff or truncation 


The approximately memorvless erroi signal, henceforth denoted as A, from either 
predictor is consistently distributed in a unimodal fashion about zero for virtually any 
scene activity. This leads to the condition 

Pr[A = 0] > Pr[A = -r 1 ) > Pr[A = - 1 ] > Pr[A = + 2J > . , . (2) 

Not surprisingly, low detail scenes yield distributions more peaked around zero than high 
detail scenes because the prediction works better. Similarly, the distributions will 
generally be more peaked for the two-dimensional predictor although this is only signifi- 
cant on detailed scenes. 

Because of the consistent shape of the error signal. A, the desired basic mapping 
ir *) the integers, S, such that the probability ordering of condition (1 ) is almost always 
welt approximated, is given by Table 2 and expressed analytically as^ 


f Observe that the performance of these very simple predictors would be noticeably af- 
fected by "sensoi noise" (e.g., the familiar gain and offset variations of today's CCD 
cameras) but these variations can be considered negligible for the Voyager system. 

^Observe that this statement remains true even if one or more pixel least significant 
bits are "shifted out" before data entry. The distributions of both A and S would simply 
become more peaked at zero by this procedure. 
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Table 2. Basic Mapping of \ into the Integers, d. 


Prediction 

Error 

A 

Integer 

5 

0 

0 

-H 1 

1 

- 1 

2 

+ 2 

3 

-2 

4 

-^3 

1 . 

5 


( 2A - 1 if A > 0 

6 =. 

( 2|A| if A < 0. 


(3) 


This simple mapping is adequate for most imaging situations, however, it does 
not utilize a dynamic range constraint provided by predictor values Xj j in Table 1 . The 
following modification to this basic mapping offers advantages when a significant por- 
tion of the data values occur near the boundaries of the dynamic range. 

For simplicity let "x denote the prediction of a pixel which takes on the value x. The 
prediction error is then 


= X - X . 


(4) 


Because the possible values of x and x are constrained to the range [0, so are 

the values of A. In fact 


^ A 

■ X < A < x 


MAX 


- X. 


(5) 
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Thus the inequalities in (2) are only good approximations out to the limits specified in 
(5). Outside this range the basic mapping of A to 5 assigns numbers to events which 
can't happen, thus violating the desired goals in (1). Afte** some manipulation a modifi- 
cation to (3) which takes advantage of these observations can be derived. The result is 
given as 




1 

< 

CM 

if 

0 < A i.x 

X 

if 

A > X 

2iAl 

if 

X - x...^<AsO 



MAX 

X. . . .. - X 

if 

X — X. . ^ > A. 

MAX 


MAX 


( 6 ) 


Image Noiseless Coder Structure 

The basic overall image noiseless coder structure is illustrated in Tig. 1 where he 
variable length coding operations which follow the reversible preprocessing have been 
denoted collectively as ^(*1 J Note that an initial reference pixel needs to be sent to 
assure complete reversibility from a sequence of differences. 


Practical Performance Measure 

Let and P^ be the probability distributions for the occurrence of A and 6 
samples in Fig. 1 . By the one-to-one mapping in Table 1 , P^ and P^ are equivalent. Then 
the ei^tropy of distributions is given by 


H 


A 



E 


Pj log^ p. bits/sample 


(7) 


^That is, ^1*1 denotes the reversible operations that map an input sequence X into 
coded sequence Subsequent discussion will subscript and superscript \i> to de- 
note different coding algorithms. 
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Fig. 1 . Basic Image Noiseless Coding Structure. 

where the | p- } are the probability values making up and Pg. Entropies arising from 
the two predictors in Table 1 are generally called one and two-dimensional different!^ 
entropies respectively. Since Hg is really an average measure of uncertainty in 5 
samples, the two-dimensional predictor generally v:«lds a lower entropy. 

The entropy, Hg, provides a practical measure of performance for code operators 
^l.) regardless of how the sequences, 6. originated. However, for the imaging applica- 
tion Hg (which squals H^) also provides a practical measure of overall “image noiseless 
coder'' performance. Specifically, Hg represents a bound to the best performance 
(bits/sample, bits/pixel) of any coding algorithm which treats the 6's as an independent, 
stationary sequence of samples. It is not an absolute bound to performance unless the 
latter conditions are true. In the imaging application the independence condition is cer- 
tainly well approximated by the predictive decorrelation but the obvious fluctuations In 
scene activity preclude stationarity. This just means that performance under a 
"measured" Hg may be possible under some conditions if the coding algorithms, \p[»], 
did not rely on stationarity. 
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Quite obviously, there are both long term and short term variations in image activ- 
ity so that the value of a measured depends much on the span of data over which it is 
measured. In this paper we will deal only with entropies of distributions derived from 
complete individual images. Performance close to such measured entropies can gener- 
ally be considered a good guide to efficient performance. The key practical problem in 
the specification of is to assure that such efficient performance can be expected for 
all images encountered. In Voyager's case must also meet severe implementation 
constraints. 

Observe from (7) that for imaging applications there is a one-to-one correspon- 
dence between performance in bits per 8 sample and bits per input pixel. However, 6 se- 
quences with the same characteristics could have originated from sources other than 
imaging. Hence to emphasize this broader applicability, results will primarily be noted in 
bits/sample. We will switch to bits/pixel (b/p) when referring to specific imaging results. 


Coder Operation at Fixed Line Rates 

Another restriction placed on the operation of a Voyager image noiseless coder is 
that it must operate at pre-established line rates (bits/line). But as already noted, data 
entropy varies and consequently the number of bits used by an efficient noiseless coder 
will also vary, from line-to-line and image to image. The primary mode to satisfy these 
format constraints will be to simply truncate a line when its allotted bits have been u.sed 
up.^ Special modes are being considered to more intelligently edit data when an object 
of interest does not fill a field of view. However, most of the investigations of coding ef- 
ficiency here will deal with complete images. 


COMMUNICATION ERROR EFFECTS 

The effect of communication errors on data compressed by the noiseless coding 
algorithms treated here is significantly more dramatic than on uncompressed imaging. 
Instead of affecting only a single pixel, an error may confuse a noiseless decompressor 
so that succeeding pixels may be wrong until the system is reinitialized (e.g., once per 


^Ref. 7 describes a rate controlled algorithm developed for the Galileo mission which 
adaptively changes local quantization to avoid truncation of lines. 
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line). These effects can be countered to a certain extent by incorporating more frequent 
updates and using a more sophisticated decompressor that finds errors by looking for 
the start of anomalous data. However, we will ignore these possibilities here since the 
primary communication mode at Uranus and Neptune should be “virtually error free". 
This is provided by a sophisticated concatenated channel incorporating an outer Reed- 
Solomon block code (symbol size J = 8, error-correction E = 1 6) coupled with the well 
known convolutionally coded-Viterbi decoded inner channel. References 8 and 9 pro- 
vide discussions on the system implications of this channel for which the framework of 
international standardization has now been established. Numerous papers further 
treating performance and nplementation for space applications are cited in Ref. 8. 
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II. BASIC ADAPTIVE VARIABLE LENGTH CODING 


The remaining problem in the definition of noiseless coders which represent 
images efficiently is the specification and performance evaluation of the operations de- 
noted by \^(*1 in Fig. 1 which code preprocessed ^sequences efficiently. This section 
will motivate the definition of new, computationally improved algorithms by introducing 
some basic coding operations, definitions and notation originating in Ref. 1 . Subse- 
quently, notation will be extended to include b*th the new and old algorithms under one 
definition. The performance of several alternatives will be graphically compared using a 
broad test set of images. Unless noted otheiwise, all perforn^ance results noted subse- 
quently assume image preprocessing based on the one-dimensional prediction in Table 1 
and the standard 6 mapping in (3). Results for variations to this approach will be ex- 
plicitly discussed in Section VI. 

SOME NOTATION CONVENTIONS 

Concatenation 

If X and Y are two sequences of samples then we can form a new sequence Z by 
running them back to back as 


Z = X * Y (8) 

using the asterisk as our basic indication of concatenation. However, the * will be omit- 
ted occasionally when no confusion should result. 

Length of a sequence. Any sequence of non-binary samples can be represented 
by a sequence of bits using the familiar binary representations which use a fixed number 
of bits. Without any anticipated confusion, the function .'?’(•) will be used to specify the 
length of any sequence in samples or bits (of its standard binary representation) as re- 
quired. For example, if X is a sequence of J samples requiring m bits/ sample to represent 
we may take 


mx) 


(S) 


as J samples or mJ bits. Of course there is only one possibility if X is already a binary 
sequence. 
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FUNDAMENTAL SEQUENCE 


PE PC'OA ■ 


Consider first an extremely simple variable length coding operation which will 
become central in later developments. Define the code word function fs[»] by 

i zeroes 

fslil s 0 0 0 . . . 0 0 1 ;i0) 

where i > 0 is an input integer. The length of “codeword" fs[i] is 

(. = 5^(fs[il) = i+1 bits. (11) 


Let 


X = Xi X 2 . . . Xj (1 *>) 

denote a sequence of samples meeting the conditions of preprocessing described ear- 
lier. Then, the coding of X using fs(»I on each sample yields the fundamental sequence 
of X 


^.,(X1 = FSIX] s fs(x^) * fslx^J * ... * fs[Xj]. (13) 

Th?t is, [•] or FShl denote the operations of applying (1 0) to each symbol of a 
sequence. The length of a fundamental sequence is 


J 

Fq = 4?(FS[X1) = J -I- X.. (14) 

j=1 ' 


Observations 

Because of the assumed probability ordering of input symbols in ( 1 ) and the code- 
word lengths in (1 1 ), shorter codewords will be used more often than longer ones. Note 
that the definition in (10) does not depend on alphabet size. 
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A plot of the typical average per sample performance of code operator I = FSl») is 
shown in Fig, 2 as a function of measured data entropy . The graph was derived from 
the results of preprocessing many forms of data such that condition (1) was well 
approxim ted. 

Note that performance is efficient (close to the entropy) in the range of roughly 
1 .5 to 3.0 bits/sample, but rapidly becomes inefficient outside this range. Unfortunate- 
ly, Voyager Uranus and Neptune encounters may generate some images with entropies 
exceeding 4 b/p although the majority are expected to be at much lower values. Other 
applications can have much broader swings in data entropy. Thus in general code 
operator 1*1 = FSl«) is inadequate by itself. Additional code options are needed to ex- 
tend the range of efficient performance. 



Fig. 2. Average FS Performance. 
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UNITY CODE OPERATOR 






A trivial code option is obtained by defining 


^giX] (15) 

as an> fixed length binary representation of X. In the simplest case we can take ^g[X] as 
X itself so that 


^glXl = X. (16) 

However, in some applications such as imaging it may be advantageous to take ^g[X] as 
a fixed length binary representation before reversible preprocessing, can be inter- 
preted as a "backup" code operation to be used when all else fails. Obviously, the per- 
formance of ^gl*] is independent of entropy. 

OPERATORS AND 

Code operators ^q[»] and known as "code fs bar" and "code fs") of- 

fer efficient performance in entropy ranges below 1.5 bits/sample and above 3.0 
bits/sample respectively as illustrated in Fig. 3. They are defined in Ref. 1 . 



Fig. 3. Average Performance, 4^qI*], and vi'gM- 
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quality 

THE BASIC COMPRESSOR, 

An adaptive coder called the Basic Compressors^ can be defined by choosing 
betweeri these four options as 

^^IX] = BCJX) = ID * ^,p[Xl (17) 

where the concatenated ID is assumed to be a 2-bit binary number whereas, as a 
subscript to ^ it takes on the values 0, 1 , 2 or 3. 

The most straightforward, and *n fact optimum, selection procedure is to simply 
choose the code operator output sequence which is shortest. Other simplified proce- 
dures discussed in Ref. 1 provide nearly identical results. 

Performance 

The average performance of the Basic Compressor, used on a broad set of 
8 b/p Voyager images (from Saturn encounter) and other selected data is shown in 
Fig. 4 where the Voyager data points are indicated by the symbol x. One-dimensional 
prediction (Table 1 ) and the standard mapping (Table 2) is assumed. The impact of two- 
dimensional prediction and the modified mapping in (6) will be noted later in Section VI. 
The portion of the image test set derived from Voyager images is shown in Fig. 5. 
Subsequent performance graphs using this same test set will omit the data points 
although the anticipated Voyager entropy range will continue to be specified. 
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0.0 1.0 2.0 3.0 4.0 5.0 6.0 

ONE-DIMtNSIONAL ENTROPY, 


Fig. 4. Average Performance, 
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III. CODE STRUCTURE FOR HIGH ENTROPIES 

^ 2 ^*^ gives the Basic Compressor efficient performance out to about 4 b/p, where 
the performance curve begins to move away from tfie entropy line. For Voyager, this 
covers the entropy range expected for most images. However, in some cases, one- 
dimensional image entropies might be as high as 5 b/p and, at a more local line-by-line 
level, even higher. 

References 1-3 define additional code operators, and which combine 
"split-sample" modes with or alone to extend efficient performance to any 
higher entropy range. However, difficulties in implementing v^'21*] under Voyager flight 
data system computation constraints led us to investigate combinations of the split- 
sample modes with the simpler fundamental sequence operator, [•]. To this end we 
will first develop an appropriate code structure and then investigate performance impli- 
cations of various options. Previously defined algorithms in Refs. 1-3 will fit within this 
code structure as will the extremely simple "FAST Compressor" destined for the 
Voyager Uranus encounter. Later sections will fi . ther extend these split-sample mode 
definitions. 

BASIC SPLIT-SAMPLE OPERATIONS 

Following Ref. 1, let Mq be a sequence of N samples, represented by n bits/ 
sample using a standard binary representation (a sign bit, if present, is assumed to be 
the most significant bit). Define the basic split-sample operator SSq by 


v/here 


SS^ [M 


"i = |l° 

0 I k 0 ? 


0 


(18) 



n,k 

A,0 



] = 


N sample 
sequence made 
up of the k 
least significant 
bits of each 
Mq sample 


(19) 
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and Mq ^ is simply all the remaining most significant bits of each sample after removing 
the first k. By this definition the parameter n is really not crucial. However, its inclusion 
is useful in keeping track of more complex operations later. Thus we define 


! N sample sequence 
made up of the 
n-k most significant 
bits of each 
Mq sample 


( 20 ) 


and where SSq'^IMq] = Mq, These operations are illustrated in Fig. 6.'’' 

Alternative Structures for Imaging 

For imaging applications the split-sample operations can be used for coding pur- 
poses using the two different arrangements in Fig. 7. 


The structure in the upper portion (Structure A) assumes that the predictive pre- 
processing described in earlier sections precedes the split-sample operator SSq ^[*], 
forming Mq lequivalent to the sequence 6 in Fig. 1). Operator SSq*^[»] then generates 
sequences L® and Mq'*^ as defined in (1 9) and (20). The latter sequences are passed on 
for further coding to be discussed momentarily. 


^If m is an individual n bit sample of Mq we can write 

„, = b .2"-’ + ... + b,.2'< + b, . 2 ’'-’ 
n- 1 k k- 1 


... bg . 2 


= 2 


n-k-1 

L 

j=o 


b. . , • 2 M + ^ b. . 2^ 


k + j 


j = 0 


J 


2 ^M " 


where </>|^ and </>|_ are elements of Mq and respectively. 
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J 



Fig. 6. Basic Split-Sample Operator, SS 


n,k 

0 


[• 1 . 


By contrast the arrangement in the lower part of the diagram (Structure B) 
assumes that the split of least and most significant bits occurs first, followed by image 
reversible preprocessing. Using script letters to accentuate the difference, we let<y^ be 
the original n b/p image data which is applied directly to SSq'*^I» 1. Following the same 
definitions in (19) and (20), we take the output of SSq'*^[«] as anu^Q*^ respec- 
tively. We tfien define 



and 



as the result of image reversible preprocessing operations on...^ 


n,k 
0 • 


( 21 ) 


( 22 ) 
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STRUCTURE A 


(RANDOM) 



MORE PEAKED 
LOWER ENTROPY 
FOR L^^RGER k 



ROUGHLY 

EQUIVALENT 




Fig. 7. Alternative Sp'it-Sample Structure for Image Data. 


Structure A or Structure B. Tha statistical characteristics of l 9 and 


:^0 


*'*^n k 

roughly oquivaient to those of and . Thus, for image data sources, any coding 

algoritnms specified using l 9 and will yield very nearly the same average perfor- 
^'*0 n k ^ ^ 

rnance if Lj^ and Mq' a^e substituted. Specifically, numerous compa.’sons indicate 
that, while close. Structure A can offer an advantage of as much as 0.1 b/p over Struc- 
ture B (tne largest advantages occurring at entropies above 4 b/p). This difference may 
be small enough in some applications for an implementer to choose one approach over 
the other based solely on hardware and software considerations. However, keeping this 
tradeoff in mind, we will henceforth assume Structure A when quoting results for image 
data. Structure A is in fact more appealing since it maintains the original arrangement 
established in Fig. 1 where image preprocessing functions are completely separated 
from variable length coding functions. We again need only define and evaluate algo- 
rithms for coding preprocessed Mq sequences (6 in Fig. 1 ). Results are not dependent on 
the source of Mq sequences and are thus broadly applicable to non-imaging sources 
which can be appropriately preprocessed. 
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Preprocessing Assumption. Until Section VI, all stated results will additionally 
assume that the image preprocessing which generates Mq sequences is limited to the 
one-dimensional prediction of Table 1 and the standard mapping of erior signals into in- 
tegers given by (3). 

Motivation for Split-Sample Coding 


Now focussing on Structure A, since Mq sequences are the result of appropriate 
image preprocessing, sample distributions will exhibit the desired characteristics speci- 
fied by ( 1 ). But by assumption the entropy of Mq sequences is too high to be coded effi- 
ciently using algorithms specified in earlier sections. However, as k is increased, the 
Mq'^ sequences continue to retain the desired ordering at decreasing entropy values 
(more peaked distributions). At some value of k the entropy of Mq'^ sequences will drop 
low enough to lie within the efficient operating range of operators ^q(» 1, ^-| [•] or 
suggesting an efficient means for coding that portion of the data. 

An appropriate means for coding the remaining sequences is trivial. Until the 
decreasing entropy of Mq' sequences reaches about 3 bits/sample, the least signifi- 
cant bits appear totally rand'^m and are therefore already optimally coded. Consolidating 
these observations leads to a set of split-sample code operators, 

OPERATOR |^[.] 

In the following discussions we will modify the original definition of split-sample 
coding to allow tor additional generality. Split-sample code operator . [•] is defined 
byt 


= '■k • '• 

Equation (23) means that coding N sample sequence Mq by vi-j is accomplished by 
using to code the (n-k) most significant bit samples and prefixing the result with a 
sequence of all the k least significant bits.^ The coding structure for . [•] is given in 

l/K 

Fig. 8. 


^Observe that iZ-j will later be generalized to ^ |^,[») with equivalence when k' = 
n-k. We will continue with the simpler notation for now. 

can be placed before or after ^jlMQ*^]. 
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Special Cases^^®^ 

A case of special interest to the Voyager mission is where we take ^j[*] in (23) 
to be non-existent. That is, we take i to be <^, indicating the empty set. Equation 23 
becomes 

= (24) 

^,k 0 k 

This will code m!I noiselessly with k bits/sample if it is a priori known that the most 
u n 

significant n-k bits of each Mq sample never change (i.e., remain equal to zero) . Observa 
that (24) further reduces to already defined unity code operator if we let k = n, 
that is 

n '^0 n 

\^ , [M J = L = yPJM ]. (25) 

©,n O n 3 0 

We will later define an adaptive coder, called the "Fast Compressor," based on (24). 
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Before continuing, it is worthwhile noting some other equivalences which occur 
under special conditions. If we take k = 0 in (23) we get 


i M 
^ 0,0 ' 




^ [•] 
^1.0^ ^ 




<f>,r\ 






^gl-l 


(261 




*4l-l 


^ [•] 
^ 5 , 0 ‘ ' 




(see Ref. 1). 


The reader familiar with Ref. 1 will also note that the defitiition of |^[»] is essen- 
tially the same as ^^[»] ®V '7 defined in Refs. 1-3. This new form allows more flex- 
ibility in new definitions to follow. 

OPERATOR 

We now define an adaptive operator which chooses between the split-sample 
modes |^[*1 just defined, or any other previously defined operators, ^ [•! coding of 
sequence's Mq takes the form 

(^ulM^I-iO-^lolM^l (271 

where the function of 'ID' is to identify which (previously defined) code operator is to be 
used to represent Mq. 

.j [•] takes on exactly the same form as the "Basic Compressor'' and is in 
fact equivaleit if 'ID' is restricted to the options 0, 1, 2 and 3. But the assumption in 
(27) is that 'ID' can identify any previously defined code operator, including in 

(23) or (24). As a binary prefix to \^IdIMSi in (27) 'ID' can be represented by a fixed 
length binary code. 
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IV. PERFORMANCE FOR SELECTED y., 

In this section we will investigate the performance impact of different sets of 
code options for adaptive code operator, ^ (•]. All performance graphs include appro- 
priate bit/sample cost for code option identifiers. 

OPTIONS USING ^ q [»] - 

As already noted, choosing options [•), (i.e., ID = 0, 1, 

2 or 3) makes \l/^ .j[»l equivalent to the Basic Compressor The performance of 

^^(•l was illustrated in Fig. 4, 

OPTIONS USING 1^4 AND ,^1.] 

By expanding the allowed code options to include split-sample mode.s ^4 q[»], 

^4 2^’^' 0^*^' ^5 1^*^' ^5 2^*^ operator .j I •! be- 

comes equivalent to originally defined In Refs. 1-3. The performance of such a 
-^l*] with eight options is shown in Fig. 9 using the same test set described earlier 
(photographs of the Voyager portion of this set were displayed in Fig. 5). 

Note that allowing options 4^^ q(*1. ^2 1 "^2 2^*^' ' T®^ another equiv- 

alence to 'Agi*l (with J = N) defined in Refs. 1-3. ^gl*l performance is roughly equiva- 
lent to '/'g[»l but is somewhat simpler. 

OPTIONS USING ONLY ^[.] AND 

Figure 9 illustrates that the originally defined '/'gl*l operators assure efficient per- 
formance over any entropies exceeding about 0.7 bits/sample. But the ^ol»l operators 

*^n k 

rely primarily on ^2l*l code the split most-significant bit sequences Mq' either as 
part of the Basic Compressor or directly in the case of '/'gl*!- We now investigate the 
sole use of the simpler fundamental sequence operator for that purpose. 

Assuming the desire for a fixed length binary representation for the 'ID' prefix in 
(27) we have two practical possibilities to investigate. A three-bit 'ID' allows the identi- 
fication of up to eight options whereas a two-bit 'ID' allows up to four. Consider the 
former case first. 


^ partitions a sequence into smaller blocks which are separately coded using ^4(»1. 
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ONE DIMENSIONAL ENTROPY, BITS/SAMPLE 


Fig. 9 . Average Performance, .jf.) = ^0(»1 with 
Eight Options: 1^5 kl»l. k = 0 , 1 , 2 , ... 6, 


Eight-Option Codes 

We include the following code operator options for a 


(*) having 8 options: 
< 28 ) 
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In actual coding we let the prefix ID in (27) take on the binary values 000, 001 .010,..., 
1 1 1 to correspond to the use of operators (.}, ^ respectivsiy. 

^r example, if chosen as the most efficient operator to code a sequence 

Mq, the output for .| (.] would take the form 

\^1i(K^qI = 101 * v^1 sIMqI (29) 

which from (23) breaks down further to 

'^IiIm") = 101 * L° * (30) 


The average performance fo> this selection of code options is shown in Fig. 1 0. 
Observe that the inclusi'^n of would have no impact on expected Voyager perfor- 
mance and could be deleted for that application. In general, the resulting graph, shown 
dashed, would intercept the ordinatp Jig*- y above 1 bit/sample. The replacement of 
^qI>] with a very high en+ opy split-sample mode g[«] would have negligible impact 
on the upper end of this graph for the test set used.’ 

Note that the performance graph f rr this fundamental sequence based is 

nearly identical to that for a version which relied on the availability of to code 
the split (most significant) samples. The only difference is a slight advantage at en- 
tropies approaching 6 bits/sample, certainly of no consequence to Voyager. 

Four-Option Codes 

With only four options to choose from it becomes more crucial to pick those op- 
tions to cover the entropy range of interest. For the Voyager and other similar imaging 

problems can be omitted in favor of a higher entropy option. We consider the latter 
case first where we choose options 

'^1^*1 = '^l.l^**' ‘^1,2^*^' ^3^*^ 
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Fig. 10. .j(»] Performance with Eight Options; 

4 '^ 1^*^' ■ ■ ■' ^1 5^*^ 9nd 


which use only as the basic variable length coder. As a means of comparison we 
also investigate the four options 

^^(•1. 4'2^*'i' 4'2 <32) 


whic'^ still includes the use of ^ performance comparison is shown in Fig. 1 1 . 
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Figure 1 1 shows that the two sets of options in (31 } and (32) yield very similar 
performance at the lower entropies. However, a growing but small advantage is indi- 
cated for the second set incorporating at entropies exceeding 4 bits/sample. The 
performance differential is about 0.3 bits/sample at 5.0 bits/sample entropy. 

Now exchange the high entropy options in (31 ) and (32) for operator vq[* 1. The 
resulting code option sets are then 

(33) 
and 

(34) 


We again recognize the second set as the options making up the Sasic Compressor, 
^ 4 (»j. Thuj we are now investigating whether a , [•!, choosing between the options 
in (33), can provide a simpler alternative to the Basic Compressor for the range of entro- 
pies from 0.7 to 4 bits/sample. A comparison of performance is shown in Fig. 1 2 where 
the Basic Compressor results are the same as in Fig. 4. 

Figure 1 2 again illustrates a slight advantage at higher entropies for a four option 
^ l»l which includes ^2^*1 ®^se the Basic Compressor). The graphs are almost 

replicas of those in Fig. 1 1 but shifted downward by 1 bit/sample. But note that Figs. 9 
and 1 0 illustrate that this advantage of incorporating ^ 2 (*I can be made negligible over 
most practical entropy ranges (e.g.. Voyager) by incorporating an increased number of 
split-sample options. 


DECISION CRITERIA 

The optimum decision criteria for any .j (.] configuration is to choose the option 
which yields the shortest coded sequence. That is 

Choose ID such that 


= min .SP(\^.IMq)) 
i 


(35) 
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where i is taken over the set of a priori selected options. A simplified decision rule can 
sometimes reduce computation requirements by replacing i?(^j[MQ]) by a more easily 
obtained estimate 7 |(Mq). Taking 

(36) 


the decision rule in (35) is replaced by 

Choose ID such that 

TiofM^) = min 7j(M"). (37) 

i 

Such estimates and decision criteria we'e developed for all code operators defined in 
Refs. 1 -3. in this section we will investigate simplified estimates for split-sample modes 

Expanding 

By (23), the number of bits required by split-sample mode is given by 

^[MqD = <?(L°) -h i?(\^^lMQ^). (38) 

Ttie first term represents N samples of k least significant bits and is thus given by 

= Nk. (39) 

The second term, conveniently denoted by 

= .Y’(»^^Im"'‘'i) (40) 

is simply the length of a fundamental sequence generated for the most significant n-k bit 
samples of Mq. We will investigate its relationships in the following paragraphs. 
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Let rrij denote the jth sample of preprocessed data sequence Mq so that 


= mi . • mg . 


. * m 


N' 


( 41 ) 


Each sample of Mq can be represented in standard binary form with n bits so that 


,n-1 _n-1 .n-2 „n-2 .0 

m. = b. *2 + b. • 2 + ... + b. 

J J J J 


( 42 ) 


where b^ ^ is the most significant bit and b? 
representation. 


is the least significant bit in this 


Using this notation end (14) we have 


N n-1 


Pq = N + L 


j=l 

o 

II 

N 

n-1 

F, 


j=1 

f=1 


( 43 ) 


N n-1 


F = N + 
k 




j=1 f=k 
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Afiei some manipulation we car* relate F. to F. _ - hv the expression 

K K I 


N 






( 44 ) 


j=1 


Expanding (44), any F|^ can be related to Fq (fundamental sequence length for the 
original Mq) by 


N k-1 


F, > N + 2 
k 


-k 




f ( 

b • 2 
J 


(45) 


j=l f=0 


where we recognize the last term inside the parenthesis as the sum of all the k-bit least 
significant samples making up Lj^. That is 

Sum of 
k bit 
samples 

• tO 

in L,^ 

Substituting (45) or (46) and (39) back into (38) yields an exact calculation for 

W, kIMSi)- 

Simplified Calculations 

Whereas (45) or (46) may reduce the implementation requirements for precise 
calculations of the ] Fj^ [ , significant further simplifications result if we are willing to ac- 
cept an estimate of the | Fj^ [ based on Fq and the assumption that the k least significant 
bits are completely random. It was the latter assumption that led us to the split sample 
modes in the first place. Then, if we assume for all j and f < k 

1 1 with probability 1 12 

b - (47) 

J 

V 0 with probability 1 12 




Fundamental 
sequence 
Length for 


= N -H 2 


-k 


M 


n,k 


/ 


- N - 
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and substitution in (45) yields 

where E|»} denotes statistical expectation. 

Adding Nk from (39) provides a practical estimate for 5’(^.j |^IMqJ) L (38) 






n-K N,. 
2 + - 1 
0 2 


2'S + Nk. 


(49) 


The latter estimate and (48) are shown in ? more convenient form in Table 3 for k < 5. 

By (37) the selection of code operator . [•] can be accomplished by choosing 
the k tor which 7 .J |^(Mq) is minimum. This results in the clear cut decision regions as 
shown in the rightmost column of Table 3. 

Test results indicated that ^.| .^ [•] performance using either or the simpler F|^ in 
Table 3 could be expected to be almost identical. The largest observed performance dif- 
ference for the eight-option code in (28) was 0.01 b/p. 


FAST compressor! 101 


Consider now an extremely simple and hence computationally "FAST" .^(»1 
based on the j^I*] options in (2^). By definition a j^(»] code operator is s'mply to 
represent all of input sequence Mq by L|^ (the sequence of k least significant bits). Since 
this process is reversible only if all the n-k most significant bits of Mq are fixed (e.g., 
zero), we say that (u.sing (42)) ^ , ^l^ol = X® is a valid noiseless representation if, for 
all j 


.n -1 .n -2 

b. = b. 

J J 


.n-k 

= b. = 0 . 


(50) 
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Table 3. and 7 ^ 


k 

(Eq. 48) 

(Eq. 491 

DECISION 

REGION 

0 

•^0 

•=0 

N < "c " I N 

1 

2Fq + N 

2Fq + 5N 

5 9 

Fq<-N 

4 

4 

2 

2Fq + 3N 

2Fq 4 19N 

9 17 

2 ^- Fo ■ yN 

8 

3 

3 

2Fq + 7N 

1 

2Fq f 55N 

1 _ 

y N • Fq ' y N 

ie 

16 

4 

2Fq + 15N 

2Fq + 143N 

33 K, r 65 

y N Fq • y N 

32 

32 

5 

1 

2Fq f 31N 

2Fq 4 351N 

j 

65 161 

y N ■ Fq ■ — N 

(for eight bit data) 

64 

64 


Choosing between yp^ i^f*] options using the optimum decision criteria in (35) nieans 

^0 + 
choosing the shortest Lj^ such that (50) is true. Then letting ' 


k 


# 




log (max 

^ j 


m.) I 
J -J 


(51) 


we choose k as the smallest of the allowed options such that k k* . If all k are allowed 
ihen k = k*. 


means the smallest integer greater than or equal to x. 
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Parformancti 


The performance of a FAST compressor using the eight options (•!, 2 l*J' 

g[ 1 = and an input block of N = 16 is shown in Fig. 1 3. The graph in- 

dicates performance which remains uniformly about 0.6 bits/sainple above the entropy 
line throughout the Voyager entropy range of interest. Its computational simplicity 
makes this algorithm the primary candidate for the Voyager Uranus encounter.^ 

FAST/FS 

Exchanging ^ I*] for fundamental sequence operator l»] in the above set of 
options leads to a noticeable improvement at lower entropies as illustrated in Fig. 1 3. 


^ Recent tests indicate that a FAST block size of N = 5 is the best choice for Voyager, 
providing an improved performance of 0.13 bits/sample. 
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ONE-DIMENSIONAL ENTROPY, H,, BITS/SAMPLE 

5 

Fig. 13. FAST and FAST/FS Performance. 
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V. GENERALIZED SPLIT-SAMPLE MODES: CODING FOR SPIKES 


The Voyager camera system generates reseau marks which appear as occa- 
sional, unusually large values of 8 to any of the ^l*] coding algorithms just discussed. 
These operators are generally protected against catastrophic behaviour by the inclusion 
of "back-up" operator Consequently the aggregate impact of these spikes 

averaged over an image is minor. However, the reseau marks are so arranged that an in- 
dividual line will tend to have either several such spikes or none at all. Further, the 
buffering limitations and data format structure on Voyager require fixed line rates so 
that the effect of these events on an individual line cannot be averaged out over suc- 
ceeding lines. This section investigates remedies to this type of situation. 

The general problem is characterized by symbol probability distributions like that 
in Fig. 14. 

Lower valued numbers exhibit the d».sired probability ordering in (1 ), Pq > p.j 2 
P 2 2 ... but, due to perhaps an independent data source such as reseau marks or noise 
spikes, one or more very large numbers occur with significantly higher probability than 
some of the smaller numbers. Thus condition (1) is not well approximated for all the 
possible input numbers. This preprocessing problem is dealt with here by generalizing 
the split-sample modes to allow the two distinct distributions in Fig. 1 4 to be dealt with 
independently. 

OPERATOR $$!['''*'(.] 

n' k' 

Consider now a second version of split-sample operations, SS^ ' [•] shown in 
Fig. 1 5 where we use the parameters n' and k' to emphasize the distinction from pre- 
vious definitions. The structure appears identical to that shown in Fig. 6 but the output 
sequences of least significant bit samples, and most significant bit samples, 
M'J , are defined differently. 

Starting again with a preprocessed data sequence of n' bit samples denoted 
Mq = m.j * m 2 * ... * mj^ as in (41), let 

j.| < l2 < is < < j|_ (52) 


oF TOOK ir<JAUrjC 
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Fig. 1 4. Effect of Occasional Spikes. 
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denote the location of all m| samples such that 


Of FOOfi Qi., 


: IS 

'■•Aury 


m. >2*^ - 1, f = 1. 2 L 


( 53 ) 


#s.* 

and 0 -s L s N. Now concatenate, in order of occurrence, ail the L samples of Mq 
which meet this condition and define the result by 


_n ,k 

^ M- = = nri; * m. * ... * m. . 

B,1 0 1 12 L 


(54) 


Now for j = 1 , 2, ..., N let 


+ 

m. = 
) 


2*^ - 1 


m. 


if mj 2 - 1 


Otherwise 


(55) 


and form the sequence 


_n',k' ~n' ~+ + + + 

®A,1 '“o'= '■k- •'"2 • ■■■"'n' 


(561 


Limiting Cases 


We note the limiting cases for Sg 1»| and 'j' (»| by 


n n n n n' 

Sg (Mq 1 = m" 


= (f> (i.e.. Empty set)^ 


n ,n 
A,1 


(M 


0 



= M 


n' 

0 


(57) 


M*j can actually be non-empty if and only if an m^ equals its maximum, 2^ - 1 . In 
most real situations this would happen with very low probability so that for all practical 
purposes (57) is true. 
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n' 0 n' O 


r.n',0 ~n', ~+ « ^ 

® A , l ‘“ o '°'-0 = 0 - 0 '-* 0 . 


158 ) 


Observe that can be retrieved from the two sequences Lt, and M? . All 
samples of Mq which are less than 2 - 1 nppear unchanged in L although they can 

be represented in fixed length form with only k' bits instead of the initial n'. Any sample 

'^ 4 - 

in L . , equal to 2 - 1 is interpreted as a "flag” for a decoder to look for the true n' bit 

K ^ _i_ 

value in the sequence following L Such n'-bit samples are unchanged from 

their values in M« and appear in their original order of occurrence. 


Example 

To illustrate these oparations in a less abstract manner, consider the N = 12 
sample sequence 


Mq = Mq = 0, 1, 5, 3, 2, 1, 0, 3, 127, 1, 3, 0. (59) 

Applying operator we get, by (55) and (56), the sequence of 3-bit numbers 

= 0, 1, 5, 3, 2, 1, 0, 3, 7, 1, 3, 0. (60) 

Q 

Observe that the ninth sample of Mq has been reduced from 1 27 to 7. Only this same 
ninth sample satisfies (53) where mg > 2^-1 = 7 so that by (54) 

= ID- = eight bit sample = 127. (61) 

D, I u I y 
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Further Processing of 

The samples of L"J^, take on the values 0, 1,2, ..., 2*^-1. Sample values less 
than 2*^ - 1 occur with the same probability as in Mq in^ijg. 1 4 and so by assumption 
exhibit the desired ordering in (1 ). However, a sample of LT, equals 2*^ - 1 if any sam- 

^ n* if* 

pie of Mq exceeds 2^-2 and may thus occur more frequently than some of the smaller 
numbers. This is illustrated in Fig. 16. 


Then 


Ic ^ ^ “1“ 

To correct this situation we specify the mapping [•] of L into Lj^, 
By(56),L|^ = m^ * m^ * ■■■ where the samples of are defined in (551 . 


"I" k^ 4" 

k = « IL 

k a k 


+ + + + 
m^ * m^ 


m 


+ + 

N 


(62) 



SYMBOL 'ALUES 


Fig. 16. Probability Distribution fr Samples of Lj^, . 
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where 
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] 

1 


i 


! and 



if m. =2 - 1 

) 

•t 

if m. < a 
J 

if a s m. <2 - 1 

J 





( 63 ) 


Optimally, a should bs chosen such that 

> Pr[m.'*' = i], i > a 
J 

( 64 ) 

< Pr[m.^ = i], i < a 
J 

to ensure condition (1 » for sequences of m|^ " (see fig. 1 6). However, in practice such 
precise knowledge ot data statistics 'S unlike'y to oe sufficient to ensure such optimal- 
ity. In some cases an adaptive update of a may be desirable. 

GENERALIZED SPLIT-SAMPLE CODER, 

^ (c n' k' 

Cascading the sp>ii-sample operations SSq [•] and SS., (• i provides the struc- 

n k k' 

ture for a generalized split-sample oprration SS ' ‘ [•] and an extended aet of noise- 
less code operators, structure is illustrated in fig. 1 7. Starting from the 

left, n-bit/sample input sequence Mq is first prccessea by the basic SSq'*^ 1»] operation 

(Fig. 6) to yield the sequence cl k least significant bits, l 9, and tf<e sequence of n’ - 

n k ^ 

n-k most significant bits, Mq' , which we relabel for fur (her processing as 

Mq^Mq*^. (65) 
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n ^ ic ^ 

The most significant bit samples are then split by SS^ ' [•] (Fig. 1 5) into a sequence of 
modified k' bit samples, and a variable length sequence of n' bit samples 
L^, is then further processed by 0^ [•] (82) into Lj^ Then we have 


n,k,k' ~n /~0 
SS (Mpl = 


a 


0 ~+ + r^n',k' 

Lk, .M, 


}• 


( 66 ) 


An extended class of noiseless coders for Mq is defined by 


ot n 4" n -k 


(67) 


where is any previously defined code operator. 

APPLICATION OF ,^.[.] TO VOYAGER 

Unlike random noise spikes, the effect of periodically placed reseau marks on pre- 
processed Mq distributions is both infrequent and local in natuie. That is, only those 
lines containing reseau marks will exhibit unusually high frequencies of occurrence of 
very large symoo! \ alues. More locally, this effect appears only in regions within lines 
where reseau marks occur. Statistically a Voyager image data source then appears as a 
normal image source for very long periods (lines where no reseaus appear) interrupted 
periodically by stretches (several lines) where statistical characteriotics alternate be- 
tween normal image data and image data containing spikes (see Fig. 14). 

General Operator Definitions 

First we extend the definition of \l/. -[•] to handle sequences of data blocks. Let 

•v I I 

Z = Z.J * Z 2 * ... * Z-|- be a sequence of T preprocesoed data bloc Us (i.e., any Zj can be 
imorpreted to be an Mq) and define ^ ^ 









*11 'V 


( 68 ) 


^ Observe that some applications might benefit from additional coding of M 


n',k' 

1 
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operation of applying a specific ^ operator individually to each Z- and 
then concatenating the results in order of occurrence.^ 2 IZ] might specify the coding 
of a complete line of image data for example. 


Now let 



[•1 


(69) 


be a code operator (see 27) selected to be used on data which does not contain 

spikes (e.g., the eight option .| (•] in 28). Then by (68) the corresponding operator for 
a sequence of many data blocks such as a complete line is 

<70) 

Next, let 

'^^^[•1 (71) 


be an operator formed from one or more of the options which are chosen to 

work effectively when data spikes are present. 


Now form a 2-option .jl*] called which chooses between and 

(Mq] takes the form 




1 = 


1 


^ ,a ,r^n, 
0 * \I/ [M ] 
^ir 0 

4 ,b ,r^n 

y * yl/ (Ml 

''^1 r 0 


(72) 


where the '0' or ' 1 ' identifies to a decoder whether .j [•) or respectively, was 

used to represent Mq. 


^This is much the same as the extension of ^^l*) to i/'gl*) and ^0[*1 in Ref. 1 . 
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Again by (68), the corresponding multiple block or full-line (for imaging) applica- 
tion of ^ follows as 

Finally, define operator 

(74) 

1 1 

as a two-option ^ [•] which chooses between 2 f*l 2^*^’ inaaging this de- 
cision is made on a line-by-line basis. allows the additional 1-bit per block 

overhead of -j [•) to be avoided when there are long stretches of data without spikes 
(i.e., the lines between reseau marks in imaging). 

Split-FS Options for Voyager 

For the Voyager imaging application we take [•] as made up of 

I eight-options in (28)| (75) 

and based on experimental evidence of symbol distributions in the regions where 

1^1 ^ 

reseaus occur, the assumed eight-option set forming ^.| .| [•) is with 

i - 1, a = 1 (76a) 

and the |k,k'} pairs 

10,2(, 10, 3t, |0,4l, 11, 2t, H,3|, ll,4l, 12,3(, 12,41. (76W 

Performance. Figure 1 8 illustrates the impact of 1 (*] coding on two images of 
distinctly different entropies from the Voyager test set in Fig. 5. The originals of these 
images are shown at the top and the results of applying both 2 l*l y 1*1 a 

fixed line rate constraint of 3.0 b/p are shown directly below. The white areas on the 
right side of each image represent pixels that are missing because the allotment of 
(3.0)(800) bits per tine had been used up before the complete line could be transmitted. 
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Under such a constraint the effect of reseau marks is to substantially increase the num- 
ber of missing pixels. A comparison of the images generated using 1*1 

reveals that f’l reduces the number of pixels caused by reseau marks by approxi- 
mately 50% with performance on non-reseau lines unchanged. 

A more detailed look at performance showed that .| [•] improved performance 
on the lower entropy image by: 

a) as much as 4 b/p, and an average of 1 b/p on individual 16 sample blocks 
containing reseau marks; 

b) as much as 0.3 b/p, and an average of 0.2 b/p on lines containing reseau 
marks. 

c) an average of 0.04 b/p over a complete image 
Absolute gains on the higher entmpy image were smaller. 


Removing noiseless constraint. A slight modification to the detailed definition of 
^^k k'^*^ ^ ^ permit the flagged samples of to be reduced in quantiza- 

tion. The result is to eliminate completely the remaining reseau-caused additional loss of 
data in Fig. 18. The necessary penalty is the acceptance of local quantization error on 
he reseau marks themselves. 

Decision process for ^ [•]. With the options chosen in (75) and (76) the .j [•] 
decision process can be accomplished by coding a line with .j (•! and saving the 1-bit 
block identifiers (see 72) as a single line header. If all block ID's are zero they can be 
discarded, since in this case, the line has been coded using only [•!. 


FAST Compressor for Spikes 

We now investigate computationally simpler options for .j (•], ^ [•], etc. 

Let of (69) be 

|The eight-option FAST compressor in (24), (50) and (51 ){ (77) 
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and for an eight-option \^° ^ | • ) in ( 7 1 ) we use with k = 0 and a = 2*^ - 1 and for 

various k' we choose i to denote selected (FAST Compressor) code options from (24). 
These choices greatly simplify the general diagram for in Fig. 1 7. With k = 0, 

the basic split-sample operation SSli'*^l*l is not performed so that M« directly enters 

n' k' k' k' 

SSy ' !•) with n = n'. Since a = 2 -1, the 0^ (•) mapping is not performed and 

simply passes its input on unchanged. The resulting structure is shown< in Fig. 1 9 where 

we have replaced with its designated form from (24), 

parameter k". Our eight-option is then defined by the following | k',k"| pairs 

{2,1|, 12,2}, }2,3{, }2,4{ (78a) 

and 

l3,l{ , 13,2}, {3,3|, j3,4}. (78b) 

Performance. A simplified ^l*! cod« operator in (74) is obtained by substitut- 
ing these simplified algorithms. Fig. 20 illustrates the result of this substitution. Using 
the same two images as in Fig. 1 8, the impact of this "modified FAST compressor" on 
reseau effects is displayed. Again, a fixed line rate of 3.0 b/p is assumed. The results of 
^ from Fig. 1 8 appear at the top as s reference instead of the original images. 
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Fig. 1 9. FAST Options for (•). 
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Fig. 20. Comparison of Roseau Effects Using 8-Option FAST and Modified FAST 

with Fixed Line Rate of 3.0 b/p. 
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The modified FAST ^ reduced the number of reseau-caused missing pixels 
by about Vs on both images. This is a lower percentage reduction than observed in Fig. 
1 8 with the more complex .j (•). Tnis is because the reseau effects on the basic FAST 
algorithm here) are more pronounced. The absolute gains are actually very 

close. 


As with the more complex ^ [•), a removal of the noiseless constraint on reseau 
regions could allow further reductions in reseau effects. 


51 



VI. ADDITIONAL IMPROVEMENTS FOR IMAGE DATA 
AND PERFORMANCE SUMMARY 


The test data used to evaluate algorithm performance in Sections il-V was 
exclusively derived from image data sources. However, by choosing to follow the 
optional preprocessing Structure A in Fig. 17 all 'performance vs. entropy results are 
generally applicable to data sources which can be made to appear locally memoryless 
and have symbol probability distributions approximating condition (1) or the unusual 
situation caused by noi' e spikes depicted in Fig. 1 4. Here we note improvements in ab- 
solute performance directly attributable to simple modifications of the basic "image" 
preprocessing functions: a) prediction, and b) the mapping of prediction error into the 
positive integers. 

TWO-DIMENSIONAL PREDICTOR 

All results generated thus far have been based on the basic one-dimensional pre- 
diction algorithm (and corresponding one-dimensional entropy) defined in Table 1 . Simp- 
ly replacing that predictor with the two-dimensional version in the same table can yield 
substantial gains In absolute performance at the higher entropy values as illustrated in 
Fig. 21.14] The graph plots two-dimensional entropy vs. one-dimensional entropy for 
the same image test set we have used throughout. Entropy reductions of as much as 
0.6 b/p can be expected. 

Coding performance relative to data entropy is independent of the predictor used 
since all the desired conditions for preprocessing are met in either case. Hence, absolute 
performance of a given coding algorithm can be expected to follow any reductions in en- 
tropy resulting from the use of two-dimensional prediction. Figure 21 thus depicts the 
real gains in performance which can be expected to be passed on to such algorithms as 
.j [•] in Fig. 1 0 or the FAST Compressor in Fig. 1 3. 

Adaptive Prediction 

The graphs in Fig. 21 display average results for complete images. In Voyager's 
case we found some unusual quirks in the test set which occasionally made the one- 
dimensional predictor significantly better than the two-dimensional predictor. Choosing 
between the two options on a line-by-l.:ie basis eliminated any problems. Further 
sophistication seemed unwarranted. 
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Fig. 21. Two-Dimensional vs. One-Dimensional Entropy. 

Pictorial Results 

The results of combining the generalized split-sample modes of Section V with 
two-dimensional adaptive prediction are illustrated in Fig. 22. For comparison purposes 
pictures from both Figs. 1 8 and 20 have been reproduced here. The top two images 
have been transferred from the middle row of images in Fig. 20 and represent the results 
of the basic FAST compressor using one-dimensional ( 1 -D) prediction. Recall that this is 
the algorithm currently implemented for the Voyager Uranus encounter. The second row 
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Fig. 22. Impact of 2-D Piediction. 


54 


0 ^ images has been transferred from the bottom of Fig. 1 8 (or the top of Fig. 20) and is 
the result of using the ^ [•] defined in (76) with one-dimensional prediction. The bot- 
tom pair of images represent the results of using this same operator in combination with 
two-dimensional (adaptive) prediction (2-D). Tne advantages of two-dimensional 
prediction are quite apparent, particularly on the high entropy image. 

IMPROVEMENTS FROM DYNAMIC RANGE CONSTRAINTS 

fi II prior performance results were based on the basic mapping of prediction 
errors, A, into the integers, t** in (3'. Replacing (3) with the more complex form in (6) 
takes into account dynamic range constraints. Test using both one and two- 
d-monsional predictors, indicated typica' performance imorovements of from 0.01 to 
0.03 b/p. There is no coding penalty in utilizing (6) under any condition. 




\ 
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